Using the Annotated Bibliography as a Resource for Indicative Summarization
نویسندگان
چکیده
We report on a language resource consisting of 2000 annotated bibliography entries, which is being analyzed as part of our research on indicative document summarization. We show how annotated bibliographies cover certain aspects of summarization that have not been well-covered by other summary corpora, and motivate why they constitute an important form to study for information retrieval. We detail our methodology for collecting the corpus, and overview our document feature markup that we introduced to facilitate summary analysis. We present the characteristics of the corpus, methods of collection, and show its use in finding the distribution of types of information included in indicative summaries and their relative ordering within the summaries.
منابع مشابه
Corpus-trained Text Generation for Summarization
We explore how machine learning can be employed to learn rulesets for the traditional modules of content planning and surface realization. Our approach takes advantage of semantically annotated corpora to induce preferences for content planning and constraints on realizations of these plans. We applied this methodology to an annotated corpus of indicative summaries to derive constraint rules th...
متن کاملمقایسه روشهای مختلف یادگیری ماشین در خلاصهسازی استخراجی گفتار به گفتار فارسی بدون استفاده از رونوشت
In this paper, extractive speech summarization using different machine learning algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognitio...
متن کاملChildren and Housing Literature Review
Preamble: This overview relates to an annotated research bibliography compiled as part of this project. The bibliography is stored as a RefWorks database on the University of Auckland library databases and is accessible to anyone with a UoA login and can be copied on request to those outside the University. The name and password for this database will be supplied on request. Rather than refer t...
متن کاملSystematic literature review of fuzzy logic based text summarization
Information Overloadrq is not a new term but with the massive development in technology which enables anytime, anywhere, easy and unlimited access; participation & publishing of information has consequently escalated its impact. Assisting userslq informational searches with reduced reading surfing time by extracting and evaluating accurate, authentic & relevant information are the primary c...
متن کاملEnterprise Resource Planning Systems Research: An Annotated Bibliography
Despite growing interest, publications on ERP systems within the academic Information Systems community, as reflected by contributions to journals and international conferences, is only now emerging. This article provides an annotated bibliography of the ERP publications published in the main Information Systems journals and conferences and reviews the state of the ERP art. The publications sur...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره cs.CL/0206007 شماره
صفحات -
تاریخ انتشار 2002